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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 

THEREOF 
FIELD OF INVENTION 
The present invention relates to Chlamydia antigens and corresponding DNA 
molecules, which can be used in methods to prevent and treat Chlamydia infection in mammals, 
such as humans. 

BACKGROUND OF THE INVENTION 

Chlamydiae are prokaiyotes. They exhibit morphologic and structural similarities to 
gram-negative bacteria including a trilaminar outer membrane, which contains 
lipopolysaccharide and several membrane proteins that are structurally and functionally 
analogous to proteins found in E coli. They are obligate intra-cellular parasites with a unique 
biphasic life cycle consisting of a metabolically inactive but infectious extracellular stage and a 
replicating but non-infectious intracellular stage. The replicative stage of the life-cycle takes 
place within a membrane-bound inclusion which sequesters the bacteria away from the 
cytoplasm of the infected host cell. 

C pneumoniae is a common human pathogen, originally described as the TWAR strain 
of Chlamydia psittaci but subsequently recognised to be a new species. C. pneumoniae is 
antigenically, genetically and morphologically distinct from other chlamydia species (C. 
trachomatis, C. pecorum and C psittaci). It shows 10% or less DNA sequence homology with 
either of C trachomatis or Cpsittaci and so far appears to consist of only a single strain, TWAR. 

C pneumoniae is a common cause of community acquired pneumonia, only less frequent 
than Streptococcus pneumoniae and Mycoplasma pneumoniae (Ref 1 ,2). It can also cause upper 
respiratory tract symptoms and disease, including bronchitis and sinusitis (Ref 1,3,4,5). The 
great majority of the adult population (over 60%) has antibodies to C. pneumoniae (Ref 5), 
indicating past infection which was unrecognized or asymptomatic. 

Of considerable importance is the association of atherosclerosis and C pneumoniae 
infection. There are several epidemiological studies showing a correlation of previous infections 
with C pneumoniae and heart attacks, coronary artery and carotid artery disease (Ref 6-10). 
Moreover, the organisms has been detected in atheromas and fatty streaks of the coronary, 
carotid, peripheral arteries and aorta (Ref 11-15). Viable C pneumoniae has been recovered 
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from the coronary and carotid artery (Ref 16,17). Furthermore, it has been shown that C. 
pneumoniae can induce changes of atherosclerosis in a rabbit model (Ref 18). Taken together, 
these results indicate that it is highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial atherosclerosis remains to be 
demonstrated. 

A number of recent studies have also indicated an association between C. pneumoniae 
infection and asthma. Infection has been linked to wheezing, asthmatic bronchitis, adult-onset 
asthma and acute exacerbations of asthma in adults, and small-scale studies have shown that 
prolonged antibiotic treatment was effective at greatly reducing the severity of the disease in 
some individuals (Ref 19-24). 

In light of these results a protective vaccine against C pneumoniae infection would be of 
considerable importance. There is not yet an effective vaccine for any human chlamydial 
infection. Nevertheless, studies with G trachomatis and C psittaci indicate that this is an 
attainable goal. For example, mice which have recovered from a lung infection with C 
trachomatis are protected from infertility induced by a subsequent vaginal challenge (Ref 25). 
Similarly, sheep immunized with inactivated C psittaci were protected from subsequent 
chlamydial-induced abortions and stillbirths (Ref 26). Protection from chlamydial infections has 
been associated with Thl immune responses, particularly the induction of INFg - producing 
CD4+T-cells (Ref 27). The adoptive transfer of CD4+ cell lines or clones to nude or SCID mice 
conferred protection from challenge or cleared chronic disease (Ref 28,29), and in vivo 
depletion of CD4+ T cells exacerbated disease post-challenge (Ref 30,31). However, the 
presence of sufficiently high litres of neutralising antibody at mucosal surfaces can also exert a 
protective effect (Ref 32). 

The extent of antigenic variation within the species C pneumoniae is not well 
characterised. Serovars of C. trachomatis are defined on the basis of antigenic variation in 
MOMP, but published C. pneumoniae MOMP gene sequences show no variation between 
several diverse isolates of the organism (Ref 33-35). Regions of the protein known to be 
conserved in other chlamydial MOMPs are conserved in C pneumoniae (Ref 33,34). One study 
has described a strain of C. pneumoniae with a MOMP of greater that usual molecular weight, 
but the gene for this has not been sequenced (Ref 1). Partial sequences of outer membrane 
protein 2 from nine diverse isolates were also found to be invariant (Ref 16). The genes for 
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HSP60 and HSP70 show little variation from other chlamydial species, as would be expected. 
The gene encoding a 76kDa antigen has been cloned from a single strain of C. pneumoniae. It 
has no significant similarity with other known chlamydial genes (Ref 4). 

Many antigens recognised by immune sera to C pneumoniae are conserved across all 
5 chlamydiae, but 98kDa, 76 kDa and 54 kDa proteins may be C. pneumoniae-specific (Ref 2, 4, 
36). Immunoblotting of isolates with sera from patients does show variation of blotting patterns 
between isolates, indicating that serotypes C pneumoniae may exist (Ref 1,16). However, the 
results are potentially confounded by the infection status of the patients, since immunoblot 
profiles of a patient's sera change with time post-infection. An assessment of the number and 
10 relative frequency of any serotypes, and the defining antigens, is not yet possible. 

C. pneumoniae infection usually presents as an acute respiratory disease (i.e., cough, sore 
throat, hoarseness, and fever; abnormal chest sounds on auscultation). For most patients, the 
01 cough persists for 2 to 6 weeks, and recovery is slow. In approximately 10% of these cases, 

o 

upper respiratory tract infection is followed by bronchitis or pneumonia. Furthermore, during a 
^15 C pneumoniae epidemic, subsequent co-infection with pneumococcus has been noted in about 
=F half of these pneumonia patients, particularly in the infirm and the elderly. As noted above, 
fQ there is more and more evidence that C. pneumoniae infection is also linked to diseases other 
j\ than respiratory infections. 

i — * 

jy The reservoir for the organism is presumably people. In contrast to C. psittaci infections, 

Mao there is no known bird or animal reservoir. Transmission has not been clearly defined. It may 
g result from direct contact with secretions, from formites, or from airborne spread. There is a long 
incubation period, which may last for many months. Based on analysis of epidemics, C. 
pneumoniae appears to spread slowly through a population (case-to-case interval averaging 30 
days) because infected persons are inefficient transmitters of the organism. Susceptibility to C. 
25 pneumoniae is universal. Reinfections occur during adulthood, following the primary infection as 
a child. C. pneumoniae appears to be an endemic disease throughout the world, noteworthy for 
superimposed intervals of increased incidence (epidemics) that persist for 2 to 3 years. 
C. trachomatis infection does not confer cross-immunity to C pneumoniae. Infections are easily 
treated with oral antibiotics, tetracycline or erythromycin (2 g/d, for at least 10 to 14 d). A 
30 recently developed drug, azithromycin, is highly effective as a single-dose therapy against 
chlamydial infections. 
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In most instances, C pneumoniae infection is often mild and without complications, and 
up to 90% of infections are subacute or unrecognized. Among children in industrialized 
countries, infections have been thought to be rare up to the age of 5 y a although a recent study (E 
Normann et al, Chlamydia pneumoniae in children with acute respiratory tract infections, Acta 
5 Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported that many children in this age group 
show PCR evidence of infection despite being seronegative, and estimates a prevalence of 17- 
19% in 2-4 y olds. In developing countries, the seroprevalence of C pneumoniae antibodies 
among young children is elevated, and there are suspicions that C. pneumoniae may be an 
important cause of acute lower respiratory tract disease and mortality for infants and children in 
1 0 tropical regions of the world. 

From seroprevalence studies and studies of local epidemics, the initial C. pneumoniae 
infection usually happens between the ages of 5 and 20 y. In the USA, for example, there are 
01 estimated to be 30,000 cases of childhood pneumonia each year caused by C. pneumoniae. 
^ Infections may cluster among groups of children or young adults (e.g., school pupils or military 
15 conscripts). 

C. pneumoniae causes 10 to 25% of community-acquired lower respiratory tract infections 
g (as reported from Sweden, Italy, Finland, and the USA). During an epidemic, C. pneumonia 
J\ infection may account for 50 to 60% of the cases of pneumonia. During these periods, also, more 

5f episodes of mixed infections with S. pneumoniae have been reported. 

3 

^20 Reinfection during adulthood is common; the clinical presentation tends to be milder. Based on 
population seroprevalence studies, there tends to be increased exposure with age, which is 
particularly evident among men. Some investigators have speculated that a persistent, 
asymptomatic C pneumoniae infection state is common. 

In adults of middle age or older, C. pneumoniae infection may progress to chronic 
25 bronchitis and sinusitis. A study in the USA revealed that the incidence of pneumonia caused by 
C pneumoniae in persons younger than 60 years is 1 case per 1,000 persons per year; but in the 
elderly, the disease incidence rose three-fold. C pneumoniae infection rarely leads to 
hospitalization, except in patients with an underlying illness. 

SUMMARY OF THE INVENTION 
30 The present invention provides purified and isolated DNA molecules that encode 

Chlamydia polypeptides designated CPN1 00639 (SEQ ID No: 1,2), which can be used in 
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methods to prevent, treat, and diagnose Chlamydia infection. The encoded polypeptides include 
polypeptides having the amino acid sequence shown in SEQ ID No:3 . Those skilled in the art 
will appreciate that the invention also includes DNA molecules that encode mutants and 
derivatives of such polypeptides, which result from the addition, deletion, or substitution of non- 
essential amino acids as described herein. The invention also includes RNA molecules 
corresponding to the DNA molecules of the invention. 

In addition to the DNA and RNA molecules, the invention includes the corresponding 
polypeptides and monospecific antibodies that specifically bind to such polypeptides. 

The present invention has wide application and includes expression cassettes, vectors, 
and cells transformed or transfected with the polynucleotides of the invention. Accordingly, the 
present invention provides (i)a method for producing a polypeptide of the invention in a 
recombinant host system and related expression cassettes, vectors, and transformed or transfected 
cells; (ii) a live vaccine vector, such as a pox virus, Salmonella typhimurium, or Vibrio cholerae 
vector, containing a polynucleotide of the invention, such vaccine vectors being useful for, e.g., 
preventing and treating Chlamydia infection, in combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or prophylactic methods; (iii) a 
therapeutic and/or prophylactic method involving administration of an RNA or DNA molecule of 
the invention, either in a naked form or formulated with a delivery vehicle, a polypeptide or 
combination of polypeptides, or a monospecific antibody of the invention, and related 
pharmaceutical compositions; (iv) a method for diagnosing the presence of Chlamydia in a 
biological sample, which can involve the use of a DNA or RNA molecule, a monospecific 
antibody, or a polypeptide of the invention; and (v) a method for purifying a polypeptide of the 
invention by antibody-based affinity chromatography. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be further understood from the following description with 
reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN1 00639 (SEQ ID No: 1 - entire 
sequence and SEQ ID No: 2 — coding sequence) and the deduced amino acid sequence of the 
CPN1 00639 protein from Chlamydia pneumoniae (SEQ ID No: 3 ). 

Figure 2 shows the restriction enzyme analysis of the gene encoding the C. 
pneumoniae CPN100639 gene. 
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DETAILED DESCRIPTION OF INVENTION 

In the C. pneumoniae genome, open reading frames (ORFs) encoding chlamydial 
polypeptides have been identified. These polypeptides include polypeptides permanently found 
in the bacterial membrane structure, polypeptides that are present in the external vicinity of the 
bacterial membrane, include polypeptides permanently found in the inclusion membrane 
structure, polypeptides that are present in the external vicinity of the inclusion membrane, and 
polypeptides that are released into the cytoplasm of the infected cell. These polypeptides can be 
used in vaccination methods for preventing and treating Chlamydia infection. 

According to a first aspect of the invention, there are provided isolated 
polynucleotides encoding the precursor and mature forms of Chlamydia polypeptides. 

An isolated polynucleotide of the invention encodes (i) a polypeptide having an 
amino acid sequence that is homologous to a Chlamydia amino acid, the Chlamydia amino acid 
sequence being selected from the group consisting of: 

(a) the amino acid sequences as shown: (SEQ ID No: 3 ) 

The term "isolated polynucleotide" is defined as a polynucleotide removed from the 
environment in which it naturally occurs. For example, a naturally-occurring DNA molecule 
present in the genome of a living bacteria or as part of a gene bank is not isolated, but the same 
molecule separated from the remaining part of the bacterial genome, as a result of, e.g., a cloning 
event (amplification), is isolated. Typically, an isolated DNA molecule is free from DNA 
regions (e.g., coding regions) with which it is immediately contiguous at the 5' or 3* end, in the 
naturally occurring genome. Such isolated polynucleotides could be part of a vector or a 
composition and still be isolated in that such a vector or composition is not part of its natural 
environment. 

A polynucleotide of the invention can be in the form of RNA or DNA (e.g., cDNA, 
genomic DNA, or synthetic DNA), or modifications or combinations thereof. The DNA can be 
double-stranded or single-stranded, and, if single-stranded, can be the coding strand or the non- 
coding (anti-sense) strand. The sequence that encodes a polypeptide of the invention as shown in 
SEQ ID NOs: 1 and 2, can be (a) the coding sequence as shown in SEQ ID NOs:2 (b) a 
ribonucleotide sequence derived by transcription of (a) ; or (c) a different coding sequence; this 
latter, as a result of the redundancy or degeneracy of the genetic code, encodes the same 
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polypeptides as the DNA molecules of which the nucleotide sequences are illustrated in SEQ ID 
NOsrl to 2. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of length 
or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used 
interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence that differs 
from an amino acid sequence shown in SEQ ID No: 3 , only by one or more conservative amino 
acid substitutions, or by one or more non-conservative amino acid substitutions, deletions, or 
additions located at positions at which they do not destroy the specific antigenicity of the 
polypeptide. 

Preferably, such a sequence is at least 75%, more preferably 80%, and most 
preferably 90% identical to an amino acid sequence shown in SEQ ID No: 3 . 

Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in SEQ ID No:3 . By "amino acid 
sequence substantially identical" is meant a sequence that is at least 90%, preferably 95%, more 
preferably 97%, and most preferably 99% identical to an amino acid sequence of reference and 
that preferably differs from the sequence of reference, if at all, by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among amino 
acids of the same class. These classes include, for example, amino acids having uncharged polar 
side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having 
basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, 
such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, such as 
glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine. 

Homology is typically measured using sequence analysis software (e.g., Sequence 
Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino acid 
sequences are aligned to obtain the maximum degree of homology (/.<?., identity). To this end, it 
may be necessary to artificially introduce gaps into the sequence. Once the optimal alignment 
has been set up, the degree of homology (i.e., identity) is established by recording all of the 
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positions in which the amino acids of both sequences are identical, relative to the total number of 
positions. 

Homologous polynucleotide sequences are defined in a similar way. Preferably, a 
homologous sequence is one that is at least 45%, more preferably 60%, and most preferably 85% 
identical to (i) a coding sequence of SEQ ID NOs:l and 2. 

Polypeptides having a sequence homologous to one of the sequences shown in SEQ 
ID NO: 3 , include naturally-occurring allelic variants, as well as mutants or any other non- 
naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a 
sequence as shown in SEQ ID NO: 3 . 

As is known in the art, an allelic variant is an alternate form of a polypeptide that is 
characterized as having a substitution, deletion, or addition of one or more amino acids that does 
not alter the biological function of the polypeptide. By "biological function" is meant the 
function of the polypeptide in the cells in which it naturally occurs, even if the function is not 
necessary for the growth or survival of the cells. For example, the biological function of a porin 
is to allow the entry into cells of compounds present in the extracellular medium. The biological 
function is distinct from the antigenic function. A polypeptide can have more than one 
biological function. 

Allelic variants are very common in nature. For example, a bacterial species, e.g.,C. 
pneumoniae, is usually represented by a variety of strains that differ from each other by minor 
allelic variations. Indeed, a polypeptide that fulfills the same biological function in different 
strains can have an amino acid sequence that is not identical in each of the strains. Such an 
allelic variation may be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes from, e.g., 
studies of the Chlamydial MOMP antigen. The amino acid sequence of the MOMP varies from 
strain to strain, yet cross-strain antibody binding plus neutralization of infectivity occurs, 
indicating that the MOMP, when used as an immunogen, is tolerant of amino acid variations. 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can easily be 
retrieved by polymerase chain reaction (PCR) amplification of genomic bacterial DNA extracted 
by conventional methods. This involves the use of synthetic oligonucleotide primers matching 
upstream and downstream of the 5 f and 3' ends of the encoding domain. Suitable primers can be 
designed according to the nucleotide sequence information provided in SEQ ID NOs:l and 2. 
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Typically, a primer can consist of 10 to 40, preferably 15 to 25 nucleotides. It may be also 
advantageous to select primers containing C and G nucleotides in a proportion sufficient to 
ensure efficient hybridization; e.g., an amount of C and G nucleotides of at least 40%, preferably 
50% of the total nucleotide amount 

Useful homologs that do not naturally occur can be designed using known methods 
for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes 
and/or deletions. For example, sequences of the antigen from different species can be compared 
to identify conserved sequences. 

Polypeptide derivatives that are encoded by polynucleotides of the invention include, 
e.g., fragments, polypeptides having large internal deletions derived from full-length 
polypeptides, and fusion proteins. 

Polypeptide fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences shown in SEQ ID NO: 3 , to the extent that the 
fragments retain the substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large internal deletions that remove a 
substantial part of the parent polypeptide, while retaining specific antigenicity. Generally, 
polypeptide derivatives should be about at least 12 amino acids in length to maintain 
antigenicity. Advantageously, they can be at least 20 amino acids, preferably at least 50 amino 
acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in 
length- 
Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using 
computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens 
having potential as surface-exposed, antigenic regions (Ref 37). 

Polypeptide fragments and polypeptides having large internal deletions can be used 
for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of 
importance for inducing a protective T cell-dependent immune response. Deletions can also 
remove immunodominant regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments and variants of 
protein immunogens as vaccines, as all that is required to induce an immune response to a 
protein is a small {e.g., 8 to 10 amino acid) immunogenic region of the protein. This has been 
done for a number of vaccines against pathogens other than Chlamydia. For example, short 
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synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine 
mammary tumor virus, peptide containing 11 amino acids; (Ref 38), Semliki Forest virus, 
peptide containing 16 amino acids (Ref 39), and canine parvovirus, 2 overlapping peptides, each 
containing 15 amino acids (Ref 40), have been shown to be effective vaccine antigens against 
their respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides having large 
internal deletions can be constructed using standard methods (Ref 41), for example, by PCR, 
including inverse PCR, by restriction enzyme treatment of the cloned DNA molecules, or by the 
method of Kunkel et ah (Ref 42) biological material available at Stratagene. 

A polypeptide derivative can also be produced as a fusion polypeptide that contains a 
polypeptide or a polypeptide derivative of the invention fused, e.g. 9 at the N- or C-terminal end, 
to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be easily 
obtained by translation of a genetic fusion, i.e., a hybrid gene. Vectors for expressing fusion 
polypeptides are commercially available, such as the pMal-c2 or pMal-p2 systems of New 
England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S- 
transferase system of Pharmacia, or the His-Tag system available from Novagen. These and 
other expression systems provide convenient means for further purification of polypeptides and 
derivatives of the invention. 

Another particular example of fusion polypeptides included in invention includes a 
polypeptide or polypeptide derivative of the invention fused to a polypeptide having adjuvant 
activity, such as, e.g. y subunit B of either cholera toxin or E. coli heat-labile toxin. Several 
possibilities are can be used for achieving fusion. First, the polypeptide of the invention can be 
fused to the N-, or preferably, to the C-terminal end of the polypeptide having adjuvant activity. 
Second, a polypeptide fragment of the invention can be fused within the amino acid sequence of 
the polypeptide having adjuvant activity. 

As stated above, the polynucleotides of the invention encode Chlamydia polypeptides 
in precursor or mature form. They can also encode hybrid precursors containing heterologous 
signal peptides, which can mature into polypeptides of the invention. By "heterologous signal 
peptide" is meant a signal peptide that is not found in the naturally-occurring precursor of a 
polypeptide of the invention. 
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A polynucleotide of the invention, having a homologous coding sequence, hybridizes, 
preferably under stringent conditions, to a polynucleotide having a sequence as shown in SEQ ID 
NOs:l to 2. Hybridization procedures are, e.g., described in Ausubel et aL 9 (Ref 41), Silhavy et 
ah (Ref 43); Davis et al. (ref 44). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in a formula that allows calculation of a critical value, the 
melting temperature above which two complementary DNA strands separate from each other Ref 
45). This formula is as follows: Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion 
concentration) - 0.6 x (% formamide). Under appropriate stringency conditions, hybridization 
temperature (Th) is approximately 20 to 40°C, 20 to 25°C, or, preferably 30 to 40°C below the 
calculated Tm. Those skilled in the art will understand that optimal temperature and salt 
conditions can be readily determined empirically in preliminary experiments using conventional 
procedures. 

For example, stringent conditions can be achieved, both for pre-hybridizing and 
hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 50% formamide or 
(ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate 
(PH7.0)). 

For polynucleotides containing 30 to 600 nucleotides, the above formula is used and 
then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions 
are defined by a Th that is 5 to 1 0°C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases do not 
exactly follow the rules set forth above. In such cases, the formula for calculating the Tm is as 
follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or modifications 
or combinations thereof, can have various applications. For example, a DNA molecule can be 
used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in 
the construction of vaccine vectors such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Chlamydia infection, (iii) as a vaccine agent (as well 
as an RNA molecule), in a naked form or formulated with a delivery vehicle and, (iv) in the 
construction of attenuated Chlamydia strains that can over-express a polynucleotide of the 
invention or express it in a non-toxic, mutated form. 

11 
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According to a second aspect of the invention, there is therefore provided (i) an 
expression cassette containing a DNA molecule of the invention placed under the control of the 
elements required for expression, in particular under the control of an appropriate promoter; (ii) 
an expression vector containing an expression cassette of the invention; (iii) a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, as well as (iv) a process for producing a polypeptide or polypeptide derivative 
encoded by a polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, under conditions that allow expression of the DNA molecule of the invention and, 
recovering the encoded polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and eucaryotic 
hosts. Eucaryotic hosts include yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris), 
mammalian cells (e.g., COS1, NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera 
frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. 
Bacterial and eucaryotic cells are available from a number of different sources to those skilled in 
the art, e.g., the American Type Culture Collection (ATCC; Rockville, Maryland). 

The choice of the expression system depends on the features desired for the expressed 
polypeptide. For example, it may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form. 

The choice of the expression cassette will depend on the host system selected as well 
as the features desired for the expressed polypeptide. Typically, an expression cassette includes 
a promoter that is functional in the selected host system and can be constitutive or inducible; a 
ribosome binding site; a start codon (ATG) if necessary, a region encoding a signal peptide, e.g., 
a lipidation signal peptide; a DNA molecule of the invention; a stop codon; and optionally a 3' 
terminal region (translation and/or transcription terminator). The signal peptide encoding region 
is adjacent to the polynucleotide of the invention and placed in proper reading frame. The signal 
peptide-encoding region can be homologous or heterologous to the DNA molecule encoding the 
mature polypeptide and can be specific to the secretion apparatus of the host used for expression. 
The open reading frame constituted by the DNA molecule of the invention, solely or together 
with the signal peptide, is placed under the control of the promoter so that transcription and 
translation occur in the host system. Promoters, signal peptide encoding regions are widely 
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known and available to those skilled in the art and includes, for example, the promoter of 
Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is 
functional in Gram-negative bacteria such as E. colt (as described in U.S. Patent No. 5,028,530 
and in Cagnon et al 9 (Ref 46); the promoter of the gene of bacteriophage T7 encoding RNA 
polymerase, that is functional in a number of E coli strains expressing 17 polymerase (described 
in U.S. Patent No. 4,952,496); OspA lipidation signal peptide ; and RlpB lipidation signal 
peptide (Ref 47). 

The expression cassette is typically part of an expression vector, which is selected for 
its ability to replicate in the chosen expression system. Expression vectors {e.g., plasmids or 
viral vectors) can be chosen from those described in Pouwels et at. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). They can be purchased from various commercial 
sources. 

Methods for transforming/transfecting host cells with expression vectors will depend 
on the host system selected as described in Ausubel et al. y (Ref 41). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the peripiasmic space, or is embedded in the cellular membrane. The 
polypeptide can then be recovered in a substantially purified form from the cell extract or from 
the supernatant after centrifagation of the recombinant cell culture. Typically, the recombinant 
polypeptide can be purified by antibody-based affinity purification or by any other method that 
can be readily adapted by a person skilled in the art, such as by genetic fusion to a small affinity 
binding domain. Antibody-based affinity purification methods are also available for purifying a 
polypeptide of the invention extracted from a Chlamydia strain. Antibodies useful for purifying 
by immunoaffinity the polypeptides of the invention can be obtained as described below. 

A polynucleotide of the invention can also be useful in the vaccine field, e.g., for 
achieving DNA vaccination. There are two major possibilities, either using a viral or bacterial 
host as gene delivery vehicle Give vaccine vector) or administering the gene in a free form, e.g., 
inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the invention 
can be evaluated as described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine vector 
such as a poxvirus, containing a DNA molecule of the invention, placed under the control of 
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elements required for expression; (ii) a composition of matter containing a vaccine vector of the 
invention, together with a diluent or carrier; particularly, (iii) a pharmaceutical composition 
containing a therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Chlamydia in a mammal (e.g., 
a human; alternatively, the method can be used in veterinary applications for treating or 
preventing Chlamydia infection of animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune response to Chlamydia ; and 
particularly, (v) a method for preventing and/or treating a Chlamydia {e.g., C trachomatis, C. 
psittaci, C. pneumonia, C pecorum) infection, which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an individual in need. Additionally, 
the third aspect of the invention encompasses the use of a vaccine vector of the invention in the 
preparation of a medicament for preventing and/or treating Chlamydia infection. 

A vaccine vector of the invention can express one or several polypeptides or 
derivatives of the invention, as well as at least one additional Chlamydia antigen, fragment, 
homolog, mutant, or derivative thereof. In addition, it can express a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the immune response (adjuvant 
effect). Thus, a vaccine vector can include an additional DNA sequence encoding, e.g., a 
chlamydial antigen , or a cytokine, placed under the control of elements required for expression 
in a mammalian cell. 

Alternatively, a composition of the invention can include several vaccine vectors, 
each of them being capable of expressing a polypeptide or derivative of the invention. A 
composition can also contain a vaccine vector capable of expressing an additional Chlamydia 
antigen, or a subunit, fragment, homolog, mutant, or derivative thereof; or a cytokine such as IL- 
2 or IL-12. 

In vaccination methods for treating or preventing infection in a mammal, a vaccine 
vector of the invention can be administered by any conventional route in use in the vaccine field, 
particularly, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, 
vaginal, or urinary tract) surface or via the parenteral (e.g., subcutaneous, intradermal, 
intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice of 
the vaccine vector. The administration can be achieved in a single dose or repeated at intervals. 
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The appropriate dosage depends on various parameters understood by skilled artisans such as the 
vaccine vector itself, the route of administration or the condition of the mammal to be vaccinated 
(weight, age and the like). 

Live vaccine vectors available in the art include viral vectors such as adenoviruses 
and poxviruses as well as bacterial vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 
Lactobacillus, Bacille bilie de Calmette-Guerin (BCG), and Streptococcus. 

An example of an adenovirus vector, as well as a method for constructing an 
adenovirus vector capable of expressing a DNA molecule of the invention, are described in 
U.S. Patent No. 4,920,209. Poxvirus vectors that can be used include, e.g., vaccinia and canary 
pox virus, described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively 
(also see, e.g., Tartaglia et aL, Virology (1992) 188:217) for a description of a vaccinia virus 
vector; and Taylor et al, Vaccine (1995) 13:539 for a reference of a canary pox). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by homologous 
recombination as described in Kieny et aL, Nature (1984) 312:163 so that the polynucleotide of 
the invention is inserted in the viral genome under appropriate conditions for expression in 
mammalian cells. Generally, the dose of vaccine viral vector, for therapeutic or prophylactic use, 
can be of from about lxlO 4 to about lxlO 11 , advantageously from about lxl 0 7 to about lxlO 10 , 
preferably of from about lxlO 7 to about lxlO 9 plaque-forming units per kilogram. Preferably, 
viral vectors are administered parenterally; for example, in 3 doses, 4 weeks apart. Those skilled 
in the art recognize that it is preferable to avoid adding a chemical adjuvant to a composition 
containing a viral vector of the invention and thereby minimizing the immune response to the 
viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that are useful as a live oral vaccine 
are described in Mekalanos et aL, Nature (1983) 306:551 and U.S. Patent No. 4,882,278 (strain 
in which a substantial amount of the coding sequence of each of the two ctxA alleles has been 
deleted so that no functional cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA 
locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA 
mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA 
sequences). These strains can be genetically engineered to express heterologous antigens, as 
described in WO 94/19482. An effective vaccine dose of a Vibrio cholerae strain capable of 
expressing a polypeptide or polypeptide derivative encoded by a DNA molecule of the invention 
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can contain, e.g., about 1x10 s to about lxlO 9 , preferably about lxlO 6 to about 1x10* viable 
bacteria in an appropriate volume for the selected route of administration. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

5 Attenuated Salmonella typhimurium strains, genetically engineered for recombinant 

expression of heterologous antigens or not, and their use as oral vaccines are described in 
Nakayamaer aL (Bio/Technology (1988) 6:693) and WO 92/11361. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

10 Others bacterial strains useful as vaccine vectors are described in High et al. 7 EMBO 

(1992) 1 1:1991 and Sizemore et aL 9 Science (1995) 270:299 {Shigella jlexneri)\ Medaglini et a/., 
Proc. Natl. Acad. Sci. USA (1995) 92:6868 {Streptococcus gordonify and Flynn J.L., Cell. Mol. 
ui Biol. (1994) 40 (suppl. I):31, WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 
92/2 1 376 (Bacille Calmette Guerin). 

In bacterial vectors, polynucleotide; of the invention can be inserted into the bacterial 
genome or can remain in a free state, carried on a plasmid. 

An adjuvant can also be added to a composition containing a vaccine bacterial vector. 
A number of adjuvants are known to those skilled in the art. Preferred adjuvants can be selected 
[U from the list provided below. 

j~T 20 According to a fourth aspect of the invention, there is also provided (i) a composition 

£ of matter containing a polynucleotide of the invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition containing a therapeutically or prophylactically effective amount of 
a polynucleotide of the invention; (iii) a method for inducing an immune response against 
Chlamydia, in a mammal, by administering to the mammal, an immunogenically effective 
25 amount of a polynucleotide of the invention to elicit an immune response, e.g., a protective 
immune response to Chlamydia; and particularly, (iv) a method for preventing and/or treating a 
Chlamydia {e.g., C. trachomatis, C. psittaci, C. pneumoniae, or C. pecorum) infection, by 
administering a prophylactic or therapeutic amount of a polynucleotide of the invention to an 
individual in need. Additionally, the fourth aspect of the invention encompasses the use of a 
30 polynucleotide of the invention in the preparation of a medicament for preventing and/or treating 
Chlamydia infection. The fourth aspect of the invention preferably includes the use of a DNA 
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molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate in a mammalian genome. 

Polynucleotides (DNA or RNA) of the invention can also be administered as such to a 
mammal for vaccine, e.g., therapeutic or prophylactic, purpose. When a DNA molecule of the 
invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian 
cell and unable to integrate in the mammalian genome. Typically, a DNA molecule is placed 
under the control of a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. Patent No. 4,168,062) and the 
Rous Sarcoma Virus promoter (described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281). 
The desmin promoter (Li et al, Gene (1989) 78:243, Li & Paulin, J. Biol. Chem. (1991) 
266:6562 and Li & Paulin, J. Biol. Chem. (1993) 268:10403) is tissue-specific and drives 
expression in muscle cells. More generally, useful vectors are described, f.a., WO 94/21797 and 
Hartikka et a!., Human Gene Therapy (1996) 7:1205. 

For DNA/RNA vaccination, the polynucleotide of the invention can encode a 
precursor or a mature form. When it encodes a precursor form, the precursor form can be 
homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, such as 
the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several polynucleotides of the 
invention. It can also contain at least one additional polynucleotide encoding another Chlamydia 
antigen such as urease subunit A, B, or both; or a fragment, derivative, mutant, or analog thereof. 
A polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), can 
also be added to the composition so that the immune response is enhanced. These additional 
polynucleotides are placed under appropriate control for expression. Advantageously, DNA 
molecules of the invention and/or additional DNA molecules to be included in the same 
composition, can be carried in the same plasmid. 

Standard techniques of molecular biology for preparing and purifying polynucleotides 
can be used in the preparation of polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention can be formulated according to various methods. 

First, a polynucleotide can be used in a naked form, free of any delivery vehicles, 
such as anionic liposomes, cationic lipids, microparticles, e.g., gold microparticles, precipitating 
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agents, e.g., calcium phosphate, or any other transfection-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile 
saline or sterile buffered saline, with or without a carrier. When present, the carrier preferably is 
isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as 
provided by a sucrose solution, e.g., a solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist in cellular 
uptake. It can be, i.a., (i) complemented with a chemical agent that modifies the cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or 
(iii) associated with cationic lipids or silica, gold, or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: A 
Practical Approach, RPC New Ed, IRL press (1990), for a detailed description of methods for 
making liposomes) and are useful for delivering a large range of products, including 
polynucleotides. 

Cationic lipids are also known in the art and are commonly used for gene delivery. 
Such lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3-dioleyloxy)propyl]-N,N,N- 
trimethylammonium chloride), DOTAP (U-bis(oleyloxy)-3-(trimethylammonio)propane), 
DDAB (dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl spermine) 
and cholesterol derivatives such as DC-Choi (3 beta<N<^N'-dimethyl aminomethane)- 
carbamoyl) cholesterol). A description of these cationic lipids can be found in EP 187,702, 
WO 90/11092, U.S. Patent No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
No. 5,527,928. Cationic lipids for gene delivery are preferably used in association with a neutral 
lipid such as DOPE (dioleyl phosphatidylethanolamine), as, for example, described in 
WO 90/1 1092. 

Other transfection-facilitating compounds can be added to a formulation containing 
cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 
94/25608, and WO 95/2397. They include, i.a., spermine derivatives useful for facilitating the 
transport of DNA through the nuclear membrane (see, for example, WO 93/18759) and 
membrane-permeabilizing compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as described in 
WO 91/359, WO 93/17706, and Tangef a/. (Nature (1992) 356:152). In this case, the 
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microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes 
using a needleless injection device ("gene gun M ) 9 such as those described in U.S. Patent 
No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength of 
the promoter used in the DNA construct, the immunogenicity of the expressed gene product, the 
condition of the mammal intended for administration (e.g., the weight, age, and general health of 
the mammal), the mode of administration, and the type of formulation- In general, a 
therapeutically or prophylactically effective dose from about 1 ug to about 1 mg, preferably, 
from about 10 ug to about 800 ug and, more preferably, from about 25 ug to about 250 ug, can 
be administered to human adults. The administration can be achieved in a single dose or 
repeated at intervals. 

The route of administration can be any conventional route used in the vaccine field. 
As general guidance, a polynucleotide of the invention can be administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, and urinary tract 
surface; or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, 
intradermal, intraepidermal, or intramuscular route. The choice of the admimstration route will 
depend on, e.g., the formulation that is selected. A polynucleotide formulated in association with 
bupivacaine is advantageously administered into muscles. When a neutral or anionic liposome or 
a cationic lipid, such as DOTMA or DC-Choi, is used, the formulation can be advantageously 
injected via intravenous, intranasal (aerosolization), intramuscular, intradermal, and 
subcutaneous routes. A polynucleotide in a naked form can advantageously be administered via 
the intramuscular, intradermal, or sub-cutaneous routes. 

Although not absolutely required, such a composition can also contain an adjuvant. If 
so, a systemic adjuvant that does not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present application enables the design of 
specific nucleotide probes and primers that can be useful in diagnosis. Accordingly, in a fifth 
aspect of the invention, there is provided a nucleotide probe or primer having a sequence found 
in or derived by degeneracy of the genetic code from a sequence shown in SEQ ID NO: 1 to 2. 

The term "probe" as used in the present application refers to DNA (preferably single 
stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the 
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stringent conditions, as defined above, to nucleic acid molecules having sequences homologous 
to those shown in SEQ ID NOs:l and 2, or to a complementary or anti-sense sequence. 
Generally, probes are significantly shorter than full-length sequences shown in SEQ ID NOs:l 
and 2; for example, they can contain from about 5 to about 100, preferably from about 10 to 
5 about 80 nucleotides. In particular, probes have sequences that are at least 75%, preferably at 
least 85%, more preferably 95% homologous to a portion of a sequence as shown in SEQ ID 
NOs:l and 2 or that are complementary to such sequences. Probes can contain modified bases 
such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or 
diamino-2, 6-purine. Sugar or phosphate residues can also be modified or substituted. For 
10 example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al, 7 Science (1991) 
254:1497) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 2-hydroxyl group on 
ribonucleotides can be modified by including, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, as capture or detection probes. 
1 5 Such capture probes can be conventionally immobilized on a solid support, directly or indirectly, 
jp by covalent means or by passive adsorption. A detection probe can be labelled by a detection 
03 marker selected from radioactive isotopes; enzymes such as peroxidase, alkaline phosphatase, 
Ll and enzymes able to hydrolyze a chromogenic, fluorogenic, or luminescent substrate; compounds 
that are chromogenic, fluorogenic, or luminescent; nucleotide base analogs; and biotin. 

1=1 

H* 20 Probes of the invention can be used in any conventional hybridization technique, such 

as dot blot (Maniatis et al., Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot (Southern, J. Mol. BioL (1975) 
98:503), northern blot (identical to Southern blot to the exception that RNA is used as a target), 
or the sandwich technique (Dunn et a!., Cell (1977) 12:23). The latter technique involves the use 
25 of a specific capture probe and/or a specific detection probe with nucleotide sequences that at 
least partially differ from each other. 

A primer is usually a probe of about 10 to about 40 nucleotides that is used to initiate 
enzymatic polymerization of DNA in an amplification process (e.g., PCR), in an elongation 
process, or in a reverse transcription method. In a diagnostic method involving PCR, primers 
30 can be labelled. 
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Thus, the invention also encompasses (i) a reagent containing a probe of the invention 
for detecting and/or identifying the presence of Chlamydia in a biological material; (ii) a method 
for detecting and/or identifying the presence of Chlamydia in a biological material, in which (a) a 
sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from 
5 the material and denatured, and (c) exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization conditions, such that hybridization is 
detected; and (iii) a method for detecting and/or identifying the presence of Chlamydia in a 
biological material, in which (a) a sample is recovered or derived from the biological material, 
(b) DNA is extracted therefrom, (c) the extracted DNA is primed with at least one, and preferably 
10 two, primers of the invention and amplified by polymerase chain reaction, and (d) the amplified 
DNA fragment is produced. 

As previously mentioned, polypeptides that can be produced upon expression of the 
newly identified open reading frames are useful vaccine agents. 
M* Therefore, a sixth aspect of the invention features a substantially purified polypeptide 

H 15 or polypeptide derivative having an amino acid sequence encoded by a polynucleotide of the 
invention. 

S A "substantially purified polypeptide" is defined as a polypeptide that is separated 

£ 

\=k from the environment in which it naturally occurs and/or that is free of the majority of the 

Hi 

*f? polypeptides that are present in the environment in which it was synthesized. For example, a 
*Z 20 substantially purified polypeptide is free from cytoplasmic polypeptides. Those skilled in the art 
S3 will understand that the polypeptides of the invention can be purified from a natural source, Le. , a 

Chlamydia strain, or can be produced by recombinant means. 

Homologous polypeptides or polypeptide derivatives encoded by polynucleotides of 
the invention can be screened for specific antigenicity by testing cross-reactivity with an 
25 antiserum raised against the polypeptide of reference having an amino acid sequence as shown in 
SEQ ID NOs:3 . Briefly, a monospecific hyperimmune antiserum can be raised against a 
purified reference polypeptide as such or as a fusion polypeptide, for example, an expression 
product of MBP, GST, or His-tag systems or a synthetic peptide predicted to be antigenic. The 
homologous polypeptide or derivative screened for specific antigenicity can be produced as such 
30 or as a fusion polypeptide. In this latter case and if the antiserum is also raised against a fusion 
polypeptide, two different fusion systems are employed. Specific antigenicity can be determined 
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according to a number of methods, including Western blot (Towbin et a!., Proc. Natl. Acad. Sci. 
USA (1979) 76:4350), dot blot, and ELISA, as described below. 

In a Western blot assay, the product to be screened, either as a purified preparation or 
a total E. coli extract, is submitted to SDS-Page electrophoresis as described by Laemmli (Nature 
(1970) 227:680). After transfer to a nitrocellulose membrane, the material is further incubated 
with the monospecific hyperimmune antiserum diluted in the range of dilutions from about 1 :5 to 
about 1:5000, preferably from about 1:100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of the dilutions in the above range. 

In an ELISA assay, the product to be screened is preferably used as the coating 
antigen. A purified preparation is preferred, although a whole cell extract can also be used. 
Briefly, about 100 ul of a preparation at about 10 ug protein/ml are distributed into wells of a 96- 
well polycarbonate ELISA plate. The plate is incubated for 2 hours at 37°C then overnight at 
4°C. The plate is washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
(PBS/Tween buffer). The wells are saturated with 250 ul PBS containing 1% bovine serum 
albumin (BSA) to prevent non-specific antibody binding. After 1 hour incubation at 37°C, the 
plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween buffer 
containing 0.5% BSA. 100 |il of dilutions are added per well. The plate is incubated for 
90 minutes at 37°C, washed and evaluated according to standard procedures. For example, a 
goat anti-rabbit peroxidase conjugate is added to the wells when specific antibodies were raised 
in rabbits. Incubation is carried out for 90 minutes at 37°C and the plate is washed. The reaction 
is developed with the appropriate substrate and the reaction is measured by colorimetry 
(absorbance measured spectrophotometrically). Under the above experimental conditions, a 
positive reaction is shown by O.D. values greater than a non immune control serum. 

In a dot blot assay, a purified product is preferred, although a whole cell extract can 
also be used. Briefly, a solution of the product at about 100 ug/ml is serially two-fold diluted in 
50 mM Tris-HCl (pH 7.5). 100 ul of each dilution are applied to a nitrocellulose membrane 0.45 
[xm set in a 96-well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to 
the system. Wells are washed by addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air- 
dried. The membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M NaCl, 
10 g/L skim milk) and incubated with an antiserum dilution from about 1:50 to about 1:5000, 
preferably about 1:500. The reaction is revealed according to standard procedures. For example, 
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a goat anti-rabbit peroxidase conjugate is added to the wells when rabbit antibodies are used. 
Incubation is carried out 90 minutes at 37°C and the blot is washed. The reaction is developed 
with the appropriate substrate and stopped. The reaction is measured visually by the appearance 
of a colored spot, e.g., by colorimetry. Under the above experimental conditions, a positive 
5 reaction is shown once a colored spot is associated with a dilution of at least about 1:5, 
preferably of at least about 1 :500. 

Therapeutic or prophylactic efficacy of a polypeptide or derivative of the invention 
can be evaluated as described below. 

According to a seventh aspect of the invention, there is provided (i) a composition of 
10 matter containing a polypeptide of the invention together with a diluent or carrier; in particular, 
(ii) a pharmaceutical composition containing a therapeutically or prophylactically effective 
amount of a polypeptide of the invention; (iii) a method for inducing an immune response against 
Of! Chlamydia in a mammal, by administering to the mammal an immunogenically effective amount 
p of a polypeptide of the invention to elicit an immune response, e.g., a protective immune 
tl 15 response to Chlamydia:, and particularly, (iv) a method for preventing and/or treating a 
«C Chlamydia (e.g., C. trachomatis. C psittaci, C. pneumoniae, or C. pecorum) infection, by 

r= : 

rg administering a prophylactic or therapeutic amount of a polypeptide of the invention to an 
r . individual in need. Additionally, the seventh aspect of the invention encompasses the use of a 

W polypeptide of the invention in the preparation of a medicament for preventing and/or treating 

Q 

M= 20 Chlamydia infection. 

The immunogenic compositions of the invention can be administered by any 
conventional route in use in the vaccine field, in particular to a mucosal (e.g., ocular, intranasal, 
pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via the parenteral 
(e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. The 
25 choice of the administration route depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred and if a lipid formulation or an aluminum compound is used, the 
parenteral route will be preferred. In the latter case, the sub-cutaneous or intramuscular route is 
most preferred. The choice can also depend upon the nature of the vaccine agent. For example, 
30 a polypeptide of the invention fused to CTB or LTB will be best administered to a mucosal 
surface. 
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A composition of the invention can contain one or several polypeptides or derivatives 
of the invention. It can also contain at least one additional Chlamydia antigen, or a subunit, 
fragment, homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or derivative thereof can be 
formulated into or with liposomes, preferably neutral or anionic liposomes, microspheres, 
ISCOMS, or virus-like-particles (VLPs) to facilitate delivery and/or enhance the immune 
response. These compounds are readily available to one skilled in the art; for example, see 
Liposomes: A Practical Approach {supra). 

Adjuvants other than liposomes and the like can also be used and are known in the 
art. A appropriate selection can conventionally be made by those skilled in the art, for example, 
from the list provided below. 

Administration can be achieved in a single dose or repeated as necessary at intervals 
as can be determined by one skilled in the art For example, a priming dose can be followed by 
three booster doses at weekly or monthly intervals. An appropriate dose depends on various 
parameters including the recipient (e.g., adult or infant), the particular vaccine antigen, the route 
and frequency of administration, the presence/absence or type of adjuvant, and the desired effect 
(e.g., protection and/or treatment), as can be determined by one skilled in the art. In general, a 
vaccine antigen of the invention can be administered by a mucosal route in an amount from about 
10 ng to about 500 mg, preferably from about 1 mg to about 200 mg. For the parenteral route of 
administration, the dose usually should not exceed about 1 mg, preferably about 100 ^g. 

When used as vaccine agents, polynucleotides and polypeptides of the invention can 
be used sequentially as part of a multistep immunization process. For example, a mammal can 
be initially primed with a vaccine vector of the invention such as a pox virus, e.g., via the 
parenteral route, and then boosted twice with the polypeptide encoded by the vaccine vector, e.g., 
via the mucosal route. In another example, liposomes associated with a polypeptide or derivative 
of the invention can also be used for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in combination with a mucosal adjuvant (e.g., 
LT). 

A polypeptide derivative of the invention is also useful as a diagnostic reagent for 
detecting the presence of a&ti-Chlamydia antibodies, e.g., in a blood sample. Such polypeptides 
are about 5 to about 80, preferably about 10 to about 50 amino acids in length and can be labeled 
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or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a 
reagent are described below. 

Upon expression of a DNA molecule of the invention, a polypeptide or polypeptide 
derivative is produced and can be purified using known laboratory techniques. For example, the 
polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail 
that facilitates purification. The fusion product can be used to immunize a small mammal, e.g., a 
mouse or a rabbit, in order to raise antibodies against the polypeptide or polypeptide derivative 
(monospecific antibodies). The eighth aspect of the invention thus provides a monospecific 
antibody that binds to a polypeptide or polypeptide derivative of the invention. 

By "monospecific antibody" is meant an antibody that is capable of reacting with a 
unique natui^ly-occurring Chlamydia polypeptide. An antibody of the invention can be 
polyclonal or monoclonal. Monospecific antibodies can be recombinant, e.g., chimeric (e.g., 
constituted by a variable region of murine origin associated with a human constant region), 
humanized (a human immunoglobulin constant backbone together with hypervariable region of 
animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific antibodies 
can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab fragments. The 
antibodies of the invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can 
be of a single isotype or can contain a mixture of isotypes. 

The antibodies of the invention, which are raised to a polypeptide or polypeptide 
derivative of the invention, can be produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, e.g., Coligan et ah, Current Protocols 
in Immunology (1994) John Wiley & Sons, Inc., New York, NY). The antibodies can be used in 
diagnostic methods to detect the presence of a Chlamydia antigen in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography methods for 
purifying a polypeptide or polypeptide derivative of the invention. As is discussed further 
below, such antibodies can be used in prophylactic and therapeutic passive immunization 
methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for detecting the 
presence of Chlamydia in a biological sample that contains an antibody, polypeptide, or 
polypeptide derivative of the invention; and (ii) a diagnostic method for detecting the presence of 
Chlamydia in a biological sample, by contacting the biological sample with an antibody, a 
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polypeptide, or a polypeptide derivative of the invention, such that an immune complex is 
formed, and by detecting such complex to indicate the presence of Chlamydia in the sample or 
the organism from which the sample is derived. 

Those skilled in the art will understand that the immune complex is formed between a 
component of the sample and the antibody, polypeptide, or polypeptide derivative, whichever is 
used, and that any unbound material can be removed prior to detecting the complex. As can be 
easily understood, a polypeptide reagent is useful for detecting the presence of anti-Chlamydia 
antibodies in a sample, e.g., a blood sample, while an antibody of the invention can be used for 
screening a sample, such as a gastric extract or biopsy, for the presence of Chlamydia 
polypeptides. 

For use in diagnostic applications, the reagent (i.e., the antibody, polypeptide, or 
polypeptide derivative of the invention) can be in a free state or immobilized on a solid support, 
such as a tube, a bead, or any other conventional support used in the field. Immobilization can 
be achieved using direct or indirect means. Direct means include passive adsorption (non- 
covalent binding) or covalent binding between the support and the reagent. By "indirect means" 
is meant that an anti-reagent compound that interacts with a reagent is first attached to the solid 
support. For example, if a polypeptide reagent is used, an antibody that binds to it can serve as 
an anti-reagent, provided that it binds to an epitope that is not involved in the recognition of 
antibodies in biological samples. Indirect means can also employ a ligand-receptor system, for 
example, a molecule such as a vitamin can be grafted onto the polypeptide reagent and the 
corresponding receptor can be immobilized on the solid phase. This is illustrated by the biotin- 
streptavidin system. Alternatively, indirect means can be used, e.g., by adding to the reagent a 
peptide tail, chemically or by genetic engineering, and immobilizing the grafted or fused product 
by passive adsorption or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a process for purifying, 
from a biological sample, a polypeptide or polypeptide derivative of the invention, which 
involves carrying out antibody-based affinity chromatography with the biological sample, 
wherein the antibody is a monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be polyclonal or 
monospecific, and preferably is of the IgG type. Purified IgGs can be prepared from an 
antiserum using standard methods (see, e.g., Coligan et ah, supra). Conventional 
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chromatography supports, as well as standard methods for grafting antibodies, are disclosed in, 
e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988). 

Briefly, a biological sample, such as an C pneumoniae extract, preferably in a buffer 
solution, is applied to a chromatography material, preferably equilibrated with the buffer used to 
dilute the biological sample so that the polypeptide or polypeptide derivative of the invention 
(i.e., the antigen) is allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in a column. The 
unbound components are washed off and the antigen is then eluted with an appropriate elution 
buffer, such as a glycine buffer or a buffer containing a chaotropic agent, e.g., guanidine HC1, or 
high salt concentration (e.g., 3 M MgCl 2 ). Eluted fractions are recovered and the presence of the 
antigen is detected, e.g., by measuring the absorbance at 280 nm. 

An antibody of the invention can be screened for therapeutic efficacy as described as 
follows. According to an eleventh aspect of the invention, there is provided (i) a composition of 
matter containing a monospecific antibody of the invention, together with a diluent or carrier; (ii) 
a pharmaceutical composition containing a therapeutically or prophylactically effective amount 
of a monospecific antibody of the invention, and (iii) a method for treating or preventing a 
Chlamydia (e.g., C trachomatis, C. psittaci, C. pneumoniae or C. pecorum) infection, by 
administering a therapeutic or prophylactic amount of a monospecific antibody of the invention 
to an individual in need. Additionally, the eleventh aspect of the invention encompasses the use 
of a monospecific antibody of the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

To this end, the monospecific antibody can be polyclonal or monoclonal, preferably 
of the IgA isotype (predominantly). In passive immunization, the antibody can be administered 
to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, 
advantageously, in the presence of a bicarbonate buffer. Alternatively, systemic administration, 
not requiring a bicarbonate buffer, can be carried out. A monospecific antibody of the invention 
can be administered as a single active component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The amount of antibody and the 
particular regimen used can be readily determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one week, or three doses per day of 
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about 100 to 1,000 mg of antibodies over two or three days, can be an effective regimens for 
most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard methods in the 
art, e.g., by measuring induction of a mucosal immune response or induction of protective and/or 
5 therapeutic immunity, using, e.g., the C. pneumoniae mouse model . Those skilled in the art will 
recognize that the C. pneumoniae strain of the model can be replaced with another Chlamydia 
strain. For example, the efficacy of DNA molecules and polypeptides from C pneumoniae is 
preferably evaluated in a mouse model using an C. pneumoniae strain. Protection can be 
determined by comparing the degree of Chlamydia infection to that of a control group. 
10 Protection is shown when infection is reduced by comparison to the control group. Such an 
evaluation can be made for polynucleotides, vaccine vectors, polypeptides and derivatives 
thereof, as well as antibodies of the invention. 
01 Adjuvants useful in any of the vaccine compositions described above are as follows, 

p Adjuvants for parenteral administration include aluminum compounds, such as 

Nj 15 aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen can 
«p be precipitated with, or adsorbed onto, the aluminum compound according to standard protocols, 
g Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can be used in parenteral 
f administration. 

Adjuvants for mucosal administration include bacterial toxins, e.g., the cholera toxin 
20 (CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A and the pertussis toxin 
(PT), or combinations, subunits, toxoids, or mutants thereof. For example, a purified preparation 
of native cholera toxin subunit B (CTB) can be of use. Fragments, homologs, derivatives, and 
fusions to any of these toxins are also suitable, provided that they retain adjuvant activity. 
Preferably, a mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 
25 95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 95/34323 
(Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants that can be used in the 
methods and compositions of the invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-110-Asp, 
and Glu-112-Asp mutants. Other adjuvants, such as a bacterial monophosphoryl lipid A 
(MPLA) of, e.g., E. coli Salmonella minnesota, Salmonella typhimurium, or Shigella flexneri; 
30 saponins, or polylactide glycolide (PLGA) microspheres, can also be used in mucosal 
administration. 
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Adjuvants useful for both mucosal and parenteral administrations include 
polyphosphazene (WO 95/2415), DC-chol (3 b-CN-CN'.N'-dimethyl aminomethane)-carbamoyl) 
cholesterol; U.S. Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention, containing a polynucleotide, a 
polypeptide, a polypeptide derivative, or an antibody of the invention, can be manufactured in a 
conventional manner. In particular, it can be formulated with a pharmaceutical^ acceptable 
diluent or carrier, e.g., water or a saline solution such as phosphate buffer saline. In general, a 
diluent or carrier can be selected on the basis of the mode and route of administration, and 
standard pharmaceutical practice. Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical formulations, are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF. 

The invention also includes methods in which Chlamydia infection, are treated by 
oral administration of a Chlamydia polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a combination thereof. Examples of 
such compounds that can be administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and derivatives thereof (specific examples 
of antibiotics that can be used include azithromycin or doxicyclin or immunomodulators such as 
cytokines or steroids. In addition, compounds containing more than one of the above-listed 
components coupled together, can be used. The invention also includes compositions for 
carrying out these methods, ie. y compositions containing a Chlamydia antigen (or antigens) of 
the invention, an adjuvant, and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and compositions of the 
invention can readily be determined by one skilled in the art. In addition, one skilled in the art 
can readily design treatment/immunization schedules. For example, the non-vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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Figure 1 

gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys lie Pro Leu 
1 5 

cac aaa etc ctg ate tct teg act ctt gtc act ccc att eta ttg age 163 

His Lys Leu Leu lie Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 

att gca act tac gga gca gat get tct tta tec cct aca gat age ttt 211 
lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 

gat gga gcg ggc ggc tct aca ttt act cca aaa tct aca gca gat gec 259 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 

aat gga acg aac tat gtc tta tea gga aat gtc tat ata aac gat get 307 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr He Asn Asp Ala 
55 60 65 

ggg aaa ggc aca gca tta aca ggc tgc tgc ttt aca gaa act acg ggt 355 

Q Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 

H> 70 75 80 85 

M= 

f=h gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 403 

l2 Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 

4* 90 95 100 

fU 

ffl gat gcg ggt teg aat gca gga get gcg gca age aca act get gat aaa 4 51 

Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
^ 105 110 115 

'J? gee eta ate ttc aca gga ttt tct aac ctt tec ttc att gca get cct 499 

U? Ala Leu He Phe Thr Gly Phe Ser Asn Leu Ser Phe He Ala Ala Pro 

Mr 120 125 130 



» • i 



gga act aca gtt get tea gga aaa agt act tta agt tct gca gga gee 547 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 

tta aat ctt acc gat aat gga acg att etc ttt age caa aac gtc tec 595 
Leu Asn Leu Thr Asp Asn Gly Thr lie Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 

aat gaa get aat aac aat ggc gga gcg ate acc aca aaa act ctt tct 64 3 
Asn Glu Ala Asn Asn Asn Gly Gly Ala He Thr Thr Lys Thr Leu Ser 
170 175 180 

att tct ggg aat acc tct tct ata acc ttc act agt aat age gca aaa 691 
He Ser Gly Asn Thr Ser Ser He Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 

aaa tta ggt gga gcg ate tat age tct gcg get gca agt att tea gga 739 
Lys Leu Gly Gly Ala He Tyr Ser Ser Ala Ala Ala Ser He Ser Gly 
200 205 210 

aac acc ggc cag tta gtc ttt atg aat aat aaa gga gaa act ggg ggt 7 87 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 220 225 
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ggg get ctg ggc ttt gaa gec age tec teg att act caa aat age tec 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser He Thr Gin Asn Ser Ser 
230 235 240 245 

ctt ttc ttc tct gga aac act gca aca gat get gca ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gee att tat tgt gaa aaa aca gga gag act cct act ctt act ate tct 931 
Ala He Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr He Ser 
265 270 275 

gga aat aaa agt ctg ace ttc gee gag aac tct tea gta act caa ggc 97 9 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gec cat ggt eta gat ctt tec get get ggc cct ace 1027 
Gly Ala lie Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tgc ggg aac aca get gca ggc aag ggc ggc 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

m get att gca att gee gac tct gga tct tta agt etc tct gca aat caa 1123 

^ Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 

71 330 335 340 

H* gga gac ate acg ttc ctt ggc aac act eta ace tea ace tec gcg cca 1171 

Q Gly Asp He Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 

^ 345 350 355 

nj 

Lj s aca teg aca egg aat get ate tac ctg gga teg tea gca aaa att acg 1219 

^ Thr Ser Thr Arg Asn Ala He Tyr Leu Gly Ser Ser Ala Lys He Thr 
« 360 365 370 

ry aac tta agg gca gee caa ggc caa tct ate tat ttc tat gat ccg att 1267 

^ Asn Leu Arg Ala Ala Gin Gly Gin Ser He Tyr Phe Tyr Asp Pro He 
rf 375 380 385 

y3 gca tct aac ace aca gga get tea gac gtt ctg ace ate aac caa ccg 1315 

rQ Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu Thr He Asn Gin Pro 
390 395 400 405 

gat age aac teg cct tta gat tat tea gga acg att gta ttt tct ggg 1363 
Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr He Val Phe Ser Gly 
410 415 420 

gaa aag etc tct gca gat gaa gcg aaa get get gat aac ttc aca tct 1411 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 

ata tta aag caa cca ttg get eta gee tct gga ace tta gca etc aaa 1459 
He Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 450 

gga aat gtc gag tta gat gtc aat ggt ttc aca cag act gaa ggc tct 1507 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 

aca etc etc atg caa cca gga aca aag etc aaa gca gat act gaa get 1555 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 
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ate agt ctt acc aaa ctt gtc gtt gat ctt tct gec tta gag gga aat 1603 
He Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 

aag agt gtg tec att gaa aca gca gga gec aac aaa act ata act eta 1651 
Lys Ser Val Ser He Glu Thr Ala Gly Ala Asn Lys Thr lie Thr Leu 
505 510 515 

acc tct cct ctt gtt ttc caa gat agt age ggc aat ttt tat gaa age 1699 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 

cat acg ata aac caa gec ttc acg cag cct ttg gtg gta ttc act get 1747 
His Thr He Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 

get act get get age gat att tat ate gat gcg ctt etc act tct cca 17 95 
Ala Thr Ala Ala Ser Asp He Tyr He Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 

gta caa act cca gaa cct cat tac ggg tat cag gga cat tgg gaa gee 1843 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 

EP act tgg gca gac aca tea act gca aaa tea gga act atg act tgg gta 18 91 

B Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 

U 585 590 595 



□ 



act acg ggc tac aac cct aat cct gag cgt aga get tec gta gtt ccc 1939 

Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 

gat tea tta tgg gca tec ttt act gac att cgc act eta cag cag ate 1987 

Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arg Thr Leu Gin Gin He 
615 620 625 



atg aca tct caa gcg aat agt ate tat cag caa cga gga etc tgg gca 2035 
p Met Thr Ser Gin Ala Asn Ser lie Tyr Gin Gin Arg Gly Leu Trp Ala 

630 635 640 645 

]rf tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 2083 

Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 

gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 2131 
Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val Gly Gly Ser Ala Glu 
665 670 675 

gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 2179 
Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 

aaa gat aaa gac ctg ttt ata gtt gaa aat acc tct cat aac tat tta 2227 
Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 

gcg teg eta tac ctg caa cat cga gca ttc eta gga gga ctt ccc atg 2275 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 

ccc tea ttt gga agt ate acc gac atg ctg aaa gat att cct etc att 2323 
Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys Asp He Pro Leu He 
730 735 740 
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ttg aat gcc cag eta age tac age tac act aaa aat gat atg gat act 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 



2371 



cgc tat act tec tat cct gaa get caa ggc tct tgg acc aat aac tct 2419 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 

ggg get eta gag etc gga gga tct ctg get eta tat etc cct aaa gaa 24 67 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775 780 785 

gca ccg ttc ttc cag gga tat ttc ccc ttc tta aag ttc cag gca gtc 2515 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 

tac age cgc caa caa aac ttt aaa gag agt ggc get gaa gcc cgt get 2563 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 

ttt gat gat gga gac eta gtg aac tgc tct ate cct gtc ggc att egg 2611 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He Pro Val Gly He Arg 
825 830 835 

tta gaa aaa ate tec gaa gat gaa aaa aat aat ttc gag att tct eta 2659 
Leu Glu Lys He Ser Glu Asp Glu Lys Asn Asn Phe Glu He Ser Leu 
840 845 850 

gcc tac att ggt gat gtg tat cgt aaa aat ccc cgt teg cgt act tct 2707 
Ala Tyr He Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
4= 855 860 865 

- u 

rfi cta at 9 gtc agt gga gcc tct tgg act teg eta tgt aaa aac etc gca 2755 

— Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
f. 870 875 880 885 

pi 

FU c 5 a caa gcc ttc tta gca agt get gga age cat ctg act etc tec cct 2803 

p Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 

890 895 900 

•ft 

Jj« cat gta gaa etc tct ggg gaa get get tat gag ctt cgt ggc tea gca 2851 

ty His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 

905 910 915 

cac ate tac aat gta gat tgt ggg cta aga tac tea ttc tagttcctac 2900 
His He Tyr Asn Val Asp Cys Gly Leu Arg Tyr Ser Phe 
920 925 930 

tttcctccct aaacttttag ggaggaattc ttataaaaac cctgtagatt cttaacttac 2960 
tagtctctcc tttcctcttg ctttctttaa tttattgcag 3000 
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Figure 2 

Restriction enzyme analysis of CPN100639 



TspSOSI 

MboII Hpyl78III | 
Rsal Xmnl | j 

I III 



Ddel 
CviJI 
Bfal 
Avrll| 
BsaJI | 
styl| 
II 



Hpyl78III 
Bfal | 
Xbal | | 
II 



GTACGAAGTTCTTCACGA^TTATAATAC^^CCTAGGCTCTAAGTTTTGTTTCTAGATT 



- + 60 



CATGCTTCAAGAAGTGCTTTAATATTATGTTTGGATCCGAGATTCAAAACAAAGATCTAA 



Tsp509I 
Msel 
Pad 
Vspl 
Tsp509I | 
AflHI | | 

Taql Maell Msel j j 

I I II I 



Tthlllll 
SfaNI | 

I I 



CViRI 



61 



AT CG AAAACG T GTT AAT TAATTGAAC C CAAGCAT C T TTCTATG AAAAT AC C CTTG CAC AA 
+ + + + + + 

TAGCTTTTGCACAATTAATTAACTTGGGTTCGTAGAAAGATACTTTTATGGGAACGTGTT 



120 



Plel 
Dpnl 
Sau3AI | 
Hpyl78IIl| j 
MboII | j | 



Maelll 
Tsp45I 
PshAI 
Earl | 
Hinfl j 
Taql | | 



CviRI 
BsrDI ISfaNI 



ACTCCTGATCTCTTCGACTCTTGTCACTCCCATTCTATTGAGCATTGCAACTTACGGAGC 



121 



-+ 180 



TGAGGACTAGAGAAGCTGAGAACAGTGAGGGTAAGATAACTCGTAACGTTGAATGCCTCG 



CviJI 
Fnu4HI 
Taul 
Acil 
Cac8I 
Acil 
BsrBI 
Bed 
Faul 
Sthl32I | 
Alul | j 
CviJI j | 



Sfcl 



BsaXI 



I 



AGATGCTTCTTTATCCCCTACAGATAGCTTTGATGGAGCGGGCGGCTCTACATTTACTCC 
181 + + + + + + 240 

TCTACGAAGAAATAGGGGATGTCTATCGAAACTACCTCGCCCGCCGAGATGTAAATGAGG 
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Sfcl 
SfaNI I 



Hpyl78III 



SfaNI 



AAAATCTACAGC^GATGCC^TGGAACGAACTATGTCTTATCAGGAAATGTCTATA.TAAA 

241 + + + + + + 300 

TTTTAGATGTCGTCTACGGTTACCTTGCTTGATACAGAATAGTCCTTTACAGATATATTT 



Msel 
Bbvl | 
Bbvl I 



Fnu4HI 
Tsel 
Fnu4HI | 
CviJI | | 
Tsel | | 



Sthl32I 



Hpyl8 8IX 
Dpnl 
Sau3AI 
BscGI 
Bsp24I 
Cjel 
CjePI 
I 



CGATGCTGGGAAAGGCACAGCATTAACAGGCTGCTGCTTTACAGAAACTACGGGTGATCT 

301 + + -+ + + + 360 

GCTACGACCCTTTCCGTGTCGTAATTGTCCGACGACGAAATGTCTTTGATGCCCACTAGA 



HphI 



BsrI 
BciVI | 



Cjel 
CjePl| 
Bsp24l| j 



Faul 
Taal 
Sthl32l| 
Bsbl | j 
SfaNI | j j 
I I II 



NspV 
TaqI 
Bbvl | 
Acelllj j 
Acil f | | 
I III 



CviRI 



GACATTT ACTGGAAAG GG ATACTCATTTT C ATT CAACACGGTAGATG CGGGTTCG AATGC 



361 



-+ 420 



CTGTAAATGACCTTTCCCTATGAGTAAAAGTAAGTTGTGCCATCTACGCCCAAGCTTACG 



Cac8I 
Fnu4HI 
Taul 
Acil 
Fnu4HI 
Alul| 
AlwNlj 
BstAPl| 

CviJI III I CviJT 
Mwolj j j j Mboll) 
Tsel j j j j Mwol | j 

BsmI | | | | I Tthlllll| j | 

II II 

AGGAGCTGCGGCAAGCACAACTGCTGATAAAGCCCTAATCTTCACAGGATTTTCTAACCT 

421 + + + + + + 480 

TCCTCGACGCCGTTCGTGTTGACGACTATTTCGGGATTAGAAGTGTCCTAAAAGATTGGA 



Eco57I 
ScrFI 
AlwNI 
EcoRII 
Alul 
CviJI 
Fnu4HI | 
CviRI | | 
BsrDI Tsel j | 
I II I 



Acelll 

Sfcl Hpyl7flIII 
Bbvl | Taal ) 



Rsal 
Seal 
TatI I Msel 



CviRI 
Sfcl I 
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TTCCTTCATTGCAGCTCCTGGAACTACAGTTGCTTCAGGAAAAAGTACTTTAAGTTCTGC 

481 + - — + + + + + 540 

AAGGAAGTAACGTCGAGGACCTTGATGTCAACGAAGTCCTTTTTCATGAAATTCAAGACG 



Ms el 
Cjel 
CviJI | 
NlalV | | 
PstI | | | 
I II I 



CjePI 
Hinfl | 
Tfil j 



CviJI 

Cjel | Maell 



BsmAI 
BsmBI 



AG GAG C CTTAAAT CTT AC C GATAATG GAACGATT CT CTTT AG C CAAAAC GTCT C CAATG A 
541 + + + + + + 600 

TC CTCGGAATTTAG AATGG CTATTACCTTGCTAAGAGAAATCGGTTTTGC AGAGGTTACT 



Alul 
CviJI CjePI 



Dpnl 
Sau3AI 
CjePI 
Ecil | 
Hphl j 
Acil | j 
II I 



CjePI 
MboII | 
II 

AGCTAATAACAATGGCGGAGCGATCACCACAAAAACTCTTTCTATTTCTGGGAATACCTC 

601 + + + + + + 660 

TCGATTATTGTTACCGCCTCGCTAGTGGTGTTTTTGAGAAAGATAAAGACCCTTATGGAG 



Bbvl 

Sfcl Fnu4HI 

Mnll Bfal Tsp509I Dpnl | Alul Taul 

Earl | Spel| Hhal | Sau3AI I CviJI Acill 

II M II I II I II 

TTCTATAACCTTCACTAGTAATAGCGCAAAAAAATTAGGTGGAGCGATCTATAGCTCTGC 



661 



-+ 720 



AAGATATTGGAAGTGATCATTATCGCGTTTTTTTAATCCACCTCGCTAGATATCGAGACG 



CviRI 
Fnu4HI | 
CviJI | | 
Tsel | | 
II I 



Hpyl78III 



BsrI 
CviJI 
Haelll 
Eael 
Gdill 
Mspl| 
BsrFI | j 
III 



Cjel 



GGCTGCAAGTATTTCAGGAAACACCGGCCAGTTAGTCTTTATGAATAATAAAGGAGAAAC 
721 + + + + + + 780 

CCGACGTTCATAAAGTCCTTTGTGGCCGGTCAATCAGAAATACTTATTATTTCCTCTTTG 



Alul 
CviJI 
CacSI 
CviJI 

Banll Cjel | 

BsrI Bspl286I BseRI | j 
Bmrl | CviJI | CviJI | j j 
II II I II I 



Mnll 
Acelll | 
Taqi | | 



MboII 
Alul | 
CviJI j 
II 



TGGGGGTGGGGCTCTGGGCTTTGAAGCCAGCTCCTCGATTACTCAAAATAGCTCCCTTTT 



781 



-+ 840 



38 
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ACCCCCACCCCGAGACCCGAAACTTCGGTCGAGGAGCTAATGAGTTTTATCGAGGGAAAA 



Faul 
Cac8I 
PstI 
Sthl32I 
CviRI 
Fnu4HI 
Sfcl 
AlwNI 



Tsel 
BstAPI 
TspRI | 
CviRI | j 
BtsI | | | 
Bbvl Ml I 
Hpyl78III SfaNI | | |MwoI 
I I I I I I 



CviJI 
Haelll 
NlalV 
Sau96I | 
Acil | j 
Cjel | | | 
I 



C TTCT CTGG AAACACTGC AAC AG ATG CTG C AGG CAAGGGCGG GG C CATTT ATTGTGAAAA 



841 



-+ 900 



GAAGAGACCTTTGTGACGTTGTCTACGACGTCCGTTCCCGCCCCGGTAAATAACACTTTT 



Hin4I 

Cjel | ECOS7I 
Plel Cjel |j Cjel| Bce83I 

BsmAI | Hinf I j j Hpyl78III Hpyl88IX | j MboII | 

I I I M I I II I I 

AACAGGAGAGACTCCTACTCTTACTATCTCTGGAAATAAAAGTCTGACCTTCGCCGAGAA 

90 1 + + + + + + gee 

TTGT C C TCTCTG AG GATGAG AATG ATAG AGAC C TTTATTTT C AG ACTGGAAG CGGC TCTT 



Dpnl 

Bglll | 

BstYI j 

Sau3AI j 

Hpyl78IIl| | 

Bf al | | | 

Bbvl III j 

Xbal j)! j 

Nlalll | | | | | 
BsaJI I 



Bspl286I | | | | I I CviJI 

BstDSI | j j j | | Haelll 

Ncol j j | j j j Sau96l| 

Styl j j | j j j Cac8I | j 

Bmgl | j j j j j j Fnu4HI | | j 

BseSI j I j I j I j MspAlI | III 

Earl Ecil BstAPI | j I I I I j I Tsel j | | | 

Maelll Smll Acil| Mwol j j I III I I Acil || ||| 



I I II III I INI I III III 

CTCTTCAGTAACTCAAGGCGGAGCAATCTGTCCCC^TGGTCTAGATCTTTCCGCTGCTGG 
+ + + + + + 1020 

GAGAAGTCATTGAGTTCCGCCTCGTTAGACACGGGTACCAGATCTAGAAAGGCGACGACC 

CacSI 
PstI 

39 



Fan I 
Sthl32l| 
Sf aNI | | 

I II 



Acil 
Bbvl I 



CviRI 
Fnu4HI 
Sfcl 
Alul| 
CviJI j 
MspAllj 
Pvullj 
Tselj 
II 
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Fnu4HI 
Taul 
Acil|HaeII 
Mwol | Hhal | 

II 



CCCTACCCTATTTTCAAATAATAGATGCGGGAACACAGCTGCAGGCAAGGGCGGCGCTAT 



1021 



-+ 1080 



GGGATGGGATAAAAGTTTATTATCTACGCCCTTGTGTCGACGTCCGTTCCCGCCGCGATA 



Plel 
Mtinl | 
Tsp509I j 
CviRI | | 
II I 



Dpnl 
BstYI 
Sau3AI 
Hpyl78III 
HaeIV| 
Hin4l| 
Hinf I I 



BsmAI 

Alwl CviRI | BsaJI 

Msel | BsmAI | |CjeI Maell Styl 

M I I i i ii 

TGCAATTGCCGACTCTGGATCTTTAAGTCTCTCTGCAAATCAAGGAGACATCACGTTCCT 

1081 + + + + + + H40 

ACG TTAACGGCTGAGACCTAGAAATTCAGAGAGACGTTTAG TTCCTCTGTAGTGCAAGGA 



m 



Bsbl 



Cjel 



Mnll 
Hhal | 
Thai | | 
Mnll | j | 
Acil | j j |TaqI 
III I I I 



Sau3AI 
ScrFI 
BsaJI | 
EcoRIlj | 
BsmI | | j 

II 



TGGCAACACTCTAACCTCAACCTCCGCGCCAACATCGACACGGAATGCTATCTACCTGGG 



1141 



-+ 1200 



Dpnl 



ACCGTTGTGAGATTGGAGTTGGAGGCGCGGTTGTAGCTGTGCCTTACGATAGATGGACCC 



Bbvl 
CviJI 
Hael 
Haelll 
BsaJI 
Styl 
CviJI 
Fnu4HI 
Tsel 
CjePl 
Msel | 
Aflll) j 
Smll j | 

II I 



Tsp509I 
Alwl | 
I I 



Dpnl 
CjePI| 
Sau3AI | j 
Alwl | | | 

III 



ATCGTCAGCAAAAATTACGAACTTAAGGGCAGCCCAAGGCCAATCTATCTATTTCTATGA 



1201 



-+ 1260 



TAGCAGTCGTTTTTAATGCTTGAATTCCCGTCGGGTTCCGGTTAGATAGATAAAGATACT 



Hpyl88IX 
Maell | 
Hpyl88IX | j 
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Eco57I Alul | | | Mspl 

Hpyl88IX CviRI |SfaNI CviJI | j | Bed BsaWI | Hin4I 

I I I I I I I I I II I 

TCCGATTGCATCTAACACCACAGGAGCTTCAGACGTTCTGACCATCAACCAACCGGATAG 

1261 + + + + + + 1320 

AGGCTAACGTAGATTGTGGTGTCCTCGAAGTCTGCAAGACTGGTAGTTGGTTGGCCTATC 

Bbvl 
PstI 

Alul CviRI | 

Hpyl78III CviJI Sfcl | j 

I I I I I 

CAACTCGCCTTTAGATTATTCAGGAACGATTGTATTTTCTGGGGAAAAGCTCTCTGCAGA 

1321 + + + + + + 1380 

GTTGAGCGGAAAT CTAATAAGTCCTTGCTAACATAAAAGACCCCTTTTCGAGAGAC GTCT 

Fnu4HI 
Alul | 

CviJI j CviJI 
Mwol j Bf al | 

Mwol Tsel j Msel CviJI | j 

I II I III 

01 TGAAGCGAAAGCTGCTGATAACTTCACATCTATATTAAAGCAACCATTGGCTCTAGCCTC 

p 1381 + + + + + + 1440 

1^ ACTTCGCTTTCGACGACTATTGAAGTGTAGATATAATTTCGTTGGTAACCGAGATCGGAG 

^ BpulOI 
*P Ddel 
3Tj Mrxll 
NlalV | 

fe Hpyl78III | j TaqI 

r, III I 

^ TGGAACCTTAGCACTCAAAGGAAATGTCGAGTTAGATGTCAATC 

^ 1441 + + + + + + 1500 

G3 ACCTTGGAATCGTGAGTTTCCTTTACAGCTCAATCTACAGTTACCAAAGTGTGTCrrGACT 



ScrPI 
Mnlll 
EcoRIl| 
CviRI | 

CviJI Nlalll | 

BseRl| Eco57l| j 

II II I 

AGGCTCTACACTCCTCATGCAACCAGGAACAAAGCTCAAAGCAGATACTGAAGCTATCAG 

1501 + + + + + + 1560 

TCCGAGATGTGAGGAGTACGTTGGTCCTTGTTTCGAGTTTCGTCTATGACTTCGATAGTC 



Alul Alul 
CviJI AlwNI CviJI 



Dpnl Ddel 
Eco57I Sau3AI | Mnll | 

I II II 

TCrTACCAAACTTGTCGTTGATCTTTCTGCCTTAGAGGGAAATAAGAGTGTGTCCATTGA 

1561 + + + + + + 1620 

AGAATGGTTTGAACAGCAACTAG AAAG ACGG AAT CT CC CTTTAT T CTCACAC AGGT AAC T 



CviJI 
NlalVj 



BseRI 

41 
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tfl 



Bpml Rsal 
Clal | TatI | 

SfaNI TaqI |HhaI Bpml BsrI | j 



II I l!| 

AACAGCAGGAGCCAACAAAACTATAACTCTAACCTCTCCTCTTGTTTTCCAAGATAGTAG 

1621 + + + + + + 1680 

TTGTCGTCCTCGGTTGTTTTGATATTGAGATTGGAGAGGAGAACAAAAGGTTCTATCATC 

Tsp509I CviJI 
Fnu4HI | Fmi4HI | Bbvl 

Taul j CviJI CviJI Tsel | | Bbvl | 

II l l II I I I 

CGGCAATTTTTATGAAAG CCAT ACGATAAAC CAAGCCTTCACGCAGCCTTTGGTGG TATT 

1681 + + + + + + 1740 

GCCGTTAAAAATACTTTCGGTATGCTATTTGGTTCGGAAGTGCGTCGGAAACCACCATAA 

Cac8I 
Bfal | 
TspRI Nhel | 

Fnu4HI | Fnu4HI 
Tsel | j Mwol | 
Btsl | | j Tsel | 
Bbvl | III Mwol | j 
I I III I II 

CACTGCTGCTACTGCTGCTAGCGATATTTATATCGATGCGCTTCTCACTTCTCCAGTACA 

1741 + + + + + + 1800 

GTGACGACGATGACGACGATCGCTATAAATATAGCTACGCGAAGAGTGAAGAGGTCATGT 

Mnll 

Sthl32I BscGl| BsmFI Bgll 

Hpyl78III | BslI || CjePI CviJI | Mwol 

I I I II I II I 

AAC T C C AGAAC C T C ATTACGGGTAT CAGGGACATTGGGAAG C CACTTGGG C AG AC ACAT C 

1801 + + + + + + i860 

TTGAGGTCTTGGAGTAATGCCCATAGTCCCTGTAACCCTTCGGTGAACCCGTCTGTGTAG 

BpulOI 

Hpyl78III Ddel 
CviRI | Maelll CviJI Hpyl78III 

CjePI | j Sthl32l| BscGI (BseMII | 

III II III I 
AACTGCAAAATCAGGAACTATGACTTGGGTAACTACGGGCTACAACCCTAATCCTGAGCG 

1861 + + + + + + 1920 

TTGACGTTTTAGTCCTTGATACTGAACCCATTGATGCCCGATGTTGGGATTAGGACTCGC 

Sthl32I 
Hinfl 
Tf il 
Pokl | 

Alul Hpyl78IIl|| | CjePI Bce83I 

CviJI CjePI II) | SfaNI | Sfcl | 

I I III I II II 

TAGAGCTTCCGTAGTTCC CGATTCATTATGG GCATC CTTTACTGACATTCG C ACTCTACA 

1921 + + + + + + 1980 

ATCTCGAAGGCATCAAGGGCTAAGTAATACCCGTAGGAAATGACTGTAAGCGTGAGATGT 

Nlalll 
Hpyl78III | 
Real | I 
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Dpnl | | | Plel 
Sau3AI M | j Smll Mull | Hinf I Hpyl78III 

I II I I l ill i 

GCAGATCATGACATCTCAAGCGAATAGTATCTATCAGCAACGAGGACTCTGGGCATCAGG 

1981 + + + + + + 2040 

CGTCTAGTACTGTAGAGTTCGCTTATCATAGATAGTCGTTGCTC CTGAGACC CGTAGTCC 

Apol Alul 
Tsp509I CviJI 

Mboii | Tthmii I 

AlwNI | j Hpyl88IX | | 

SfaNI j |XmnI Hpyl78III BsmI | Mmel j | 

I I I I I I I I I I 

AACTG C G AATTT CTT C C AT AAGGATAAAT CAG G AACTAAC CAAG CATT C CG AC AT AAAAG 

2041 + + + + + + 2100 

TTGACGCTTAAAGAAGGTATTCCTATTTAGTCCTTGATTGGTTCGTAAGGCTGTATTTTC 

MboII 

MboII | Alul 

Mmel Hpyl88IX| j CviJI 

CviJI Mnll | Beef I Eco57I ||| Eco57I TspRI | 

I I I I I III I II 

01 CTACGGCTATATTGTTGGAGGAAGTGCTGAAGATTTTTCTGAAAATATCTTCAGTGTAGC 

□ 2101 + + --- + + + + 2160 

G ATG C CGATAT AAC AAC CT C C TT CACG ACTT CT AAAAAG AC TTTTATAGAAGTCA CATCG 

|== 

Q Alul 

CviJI Acelll 
Cac8I | Earl | 
MboII | j Sapl | Hgal 

I I I I I I 

TTTCTGCCAGCTCTTCGGTAAAGATAAAGACCTGTTTATAGTTGAAAATACCTCTCATAA 

2161 + + + + + + 2220 

AAAGACGGTCGAGAAGCCATTTCTATTTCTGGACAAATATCAACTTTTATGGAGAGTATT 



Bfal 
Avrll | 
BsaJI j 
Styl| 

CviRl TagI Mnll | | 
Mnll Mwol |BspMl|BsmI | || Nlalll 

I I I II I I II I 

CTATTTAGCGTCGCTATACCTGCAACATCGAGCATTCCTAGGAGGACTTCCCATGCCCTC 

2221 + + + + + + 2280 

GATAAATCGCAGCGATATGGACGTTGTAGCTCGTAAGGATCCTCCTGAAGGGTACGGGAG 

Alul 
CviJI 
Bpull02I 
Ddel 

Mnll Alul | 

HphI | Nlalll CviJI | 

BslI | j MslI Nspl Mnll BsmI | j 

III II I I II 
ATTTGGAAGTATCACCGAC^TGCTGAAAGATATTCCTOTCATTTTGAATGCCCAGCTAAG 
2281 + + + + + + 2340 
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TAAACCTTCATAGTGGCTGTACGACTTTCTATAAGGAGAGTAAAACTTACGGGTCGATTC 



Alul 
CviJI 
Sfcl | 



2341 



Smll 

Hpyl78III Alul | 
BciVI Bce83I | CviJI j CviJI 

I I I III I 

CTAC AGCTACACTAAAAATGATATGGATACTCGCTATACTTC CTATCCTG AAGCT CAAGG 

+ + + + + + 

GATGTCGATGTGATTTTTACTATACCTATGAGCGATATGAAGGATAGGACTTCGAGTTCC 

Dpnl 
BstYI 
Sau3AI 
Hpyl88IX 

Banll 
BsiHKAI 
Bspl286I 
SacI 
Alul 
CviJI 
Mnll 
CjePI 
Hpyl78III 
Bfal 
Banll | 

Eco57I Bspl286l| 
Avail | Xbal|| | | || || CviJI 

Sau96I CviJI III j | || || Alwl| CjePI 

|| I III I I II I I II I 

CTCTTGGACCAATAACTCTGGGGCTCTAGAGCTCGGAGGATCTCTGGCTCTATATCTCCC 



2400 



2401 



- + 2460 



GAGAACCTGGTTATTGAGAC CC CGAGATCTC GAGC CTCCTAGAGACCGAGATATAGAGGG 



2461 



ScrFI 
BsaJI | 

EcoRII | | ScrFI Sfcl 

MboII Taal ||| Msel EcoRII | AccI | CviJI 

I I III I I I II I 

TAAAGAAGCACCGTTCTTCCAGGGATATTTCCCCTTCTTAAAGTTCCAGGCAGTCTACAG 

+ + + + + + 

ATTTCTTCGTGGCAAGAAGGTCCCTATAAAGGGGAAGAATTTCJ^AGGTCCGTCAGATGTC 



2520 



Acil 
Fnu4HI 
Taul 



2521 



Bed 

Sthl32I Eco57l| 
Dral Haell BscGI | Bsal | j Bfal 

Msel | HhaljCviJI | |BsmAI || BsaXI | 

| II II M I I M M 

CCG C CAACAAAACTTTAAAGAGAGTGGCGCTGAAGC CCGTGCTTTTGATGATGGAGAC CT 

+ + + + + + 2580 

GGCGGTTGTTTTGAAATTTCTCTCACCGCGACTTCGGGCACGAAAACTACTACCTCTGGA 

BsmI Hpyl88IX MboII 

I l I 

AGTGAACTGCTCTATCCCTGTCGGCATTCGGTTAGAAAAAATCTCCGAAGATGAAAAAAA 



2581 



2640 



TCACTTGACGAGATAGGGACAGCCGTAAGCCAATCTTTTTTAGAGGCTTCTACTTTTTTT 

44 
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Hpyl78III CviJI Sthl32I 

Tsp509I TaqI I Bfal | BsaBI HphI BscGI Thai 

III ll II II 

TAATTTCGAGATTTCTCTAGCCTACATTGGTGATGTGTATCGTAAAAATCCCCGTTCGCG 

2641 + + + + + + 2700 

ATTAAAGCTCTAAAGAGATCGGATGTAACCACTACACATAGC^TTTTTAGGGGCAAGCGC 

CviJI 

NlaIV| Mull 
Rsal TspRlj Mnll Mwol 

I II I I 

TACTTCTCTAATGGTCAGTGGAGCCTCTTGGACTTCGCTATGTAAAAACCTCGCACGACA 

2701 + + + + + + 2760 

ATG AAGAG ATT AC CAGT CAC CT C GG AG AAC CTGAAG CG AT ACATTTTTGG AGCGTG C T GT 

Hinfl 
Hpyl88IX | 
Bed | | Bbvl 
Mwol Plel| | I Mnll | 

CviJI Ddel | CviJI || | | Nlalll | | 

| II III I I I II 

AGCCTTCTTAGCAAGTGCTGGAAGCCATCTGACTCTCTCCCCTC 

2761 + + + + + + 2820 

T C GG AAGAATCGTT CACGACCTTC GGTAGACTGAGAGAGGGGAGTACAT CTTGAGAGAC C 



EI1U4HI 

+ Alul | Bpull02I 

ilJ CviJI | Alul Ddel RleAI Ddel 

S Tsel CviJI CviJI | BseMII | CviJI | 

|| I M M II 

U= GGAAGC TG CTTATGAG CTTCGTGGCT CAG CACACAT CTACAATGTAG ATTGTGGGCTAAG 

ry 282i + + + + + + 2880 

O COTTCGACGAATACTCGAAGCACCGAGTCGTGTGTAGATGTTACATCTAACACCCGATTC 

Mnll BcoRI 
Bfal Mnll | Tsp509I 

I n I 

ATACTCATTCTAGTTCCTACTTTCCTCCCTAAACTTTTAGGGAGGAATTCTTATAAAAAC 

2881 - + + + + + 2940 

T ATG AGTAAG ATCAAGG ATG AAAGG AGGGATTTGAATVATC C CTC CTTAAG AATATTTT TG 

Tsp509I 

Hinfl Bfal Msel | 

Sfcl Tfil Msel Spel| BsmAI Mnll ]| CviRI 

I I I II I I M I 

CCTGTAGATTCTTAACrTACTAGTCTCrCCTTTCCTCTTG^ 

2941 + + + + + + 3000 

GGACATCT AAGAATTGAATG AT CAGAGAGGAAAGGAGAACGAAAG AAATT AAAT AAC GT C 
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